Dataset statistics
| Number of variables | 29 |
|---|---|
| Number of observations | 1574040 |
| Missing cells | 2185154 |
| Missing cells (%) | 4.8% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 1.8 GiB |
| Average record size in memory | 1.2 KiB |
Variable types
| Numeric | 12 |
|---|---|
| DateTime | 2 |
| Categorical | 15 |
SECTOR has constant value "PRIVATE" | Constant |
SALARY_TYPE has constant value "1.0" | Constant |
COUNTRY_DESC has a high cardinality: 173 distinct values | High cardinality |
JOB_DESC has a high cardinality: 2285 distinct values | High cardinality |
ECONOMIC_ACT_DESC has a high cardinality: 967 distinct values | High cardinality |
COMPANY_NAME has a high cardinality: 122145 distinct values | High cardinality |
CIVIL_ID is highly correlated with Age | High correlation |
Age is highly correlated with CIVIL_ID | High correlation |
CIVIL_ID is highly correlated with Age | High correlation |
Age is highly correlated with CIVIL_ID | High correlation |
CIVIL_ID is highly correlated with Age | High correlation |
Age is highly correlated with CIVIL_ID | High correlation |
ONR_GVRN_CODE is highly correlated with GOVERNORATE_DESC and 1 other fields | High correlation |
MARITAL_STATUS_DESC is highly correlated with RLGION_DESC and 2 other fields | High correlation |
GENDER_CODE is highly correlated with GENDER_DESC | High correlation |
Age is highly correlated with BIRTH_DATE and 2 other fields | High correlation |
COUNTRY_CODE is highly correlated with RLGION_CODE | High correlation |
BIRTH_DATE is highly correlated with Age and 2 other fields | High correlation |
Age Group is highly correlated with Age and 2 other fields | High correlation |
EDUCATION_DESC is highly correlated with RLGION_DESC and 2 other fields | High correlation |
GOVERNORATE_DESC is highly correlated with ONR_GVRN_CODE and 1 other fields | High correlation |
CIVIL_ID is highly correlated with Age and 2 other fields | High correlation |
RLGION_DESC is highly correlated with MARITAL_STATUS_DESC and 3 other fields | High correlation |
RLGION_CODE is highly correlated with COUNTRY_CODE and 1 other fields | High correlation |
EDUCATION_CODE is highly correlated with MAJOR_CODE | High correlation |
MAJOR_CODE is highly correlated with EDUCATION_DESC and 1 other fields | High correlation |
جنسية is highly correlated with MARITAL_STATUS_DESC and 2 other fields | High correlation |
GENDER_DESC is highly correlated with GENDER_CODE | High correlation |
ONR_ID is highly correlated with ONR_GVRN_CODE and 1 other fields | High correlation |
MARITAL_STATUS_CODE is highly correlated with MARITAL_STATUS_DESC | High correlation |
RLGION_CODE is highly correlated with SECTOR and 1 other fields | High correlation |
MARITAL_STATUS_DESC is highly correlated with SECTOR | High correlation |
GENDER_CODE is highly correlated with GENDER_DESC and 1 other fields | High correlation |
جنسية is highly correlated with SECTOR and 2 other fields | High correlation |
GENDER_DESC is highly correlated with GENDER_CODE and 1 other fields | High correlation |
SECTOR is highly correlated with RLGION_CODE and 8 other fields | High correlation |
Age Group is highly correlated with SECTOR | High correlation |
EDUCATION_DESC is highly correlated with جنسية and 1 other fields | High correlation |
GOVERNORATE_DESC is highly correlated with SECTOR | High correlation |
RLGION_DESC is highly correlated with RLGION_CODE and 2 other fields | High correlation |
RLGION_CODE has 71492 (4.5%) missing values | Missing |
EDUCATION_CODE has 88711 (5.6%) missing values | Missing |
MAJOR_CODE has 88711 (5.6%) missing values | Missing |
SALARY_TYPE has 1574039 (> 99.9%) missing values | Missing |
ADDRESS_AUTO_NO has 354729 (22.5%) missing values | Missing |
ECONOMIC_ACT_CODE is highly skewed (γ1 = 39.20844929) | Skewed |
EDUCATION_CODE is highly skewed (γ1 = 338.9212143) | Skewed |
Reproduction
| Analysis started | 2021-05-26 17:25:21.859082 |
|---|---|
| Analysis finished | 2021-05-26 17:53:20.723521 |
| Duration | 27 minutes and 58.86 seconds |
| Software version | pandas-profiling v3.0.0 |
| Download configuration | config.json |
| Distinct | 1574038 |
|---|---|
| Distinct (%) | > 99.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.80909332 × 1011 |
| Minimum | 1.780201 × 1011 |
|---|---|
| Maximum | 5.200400698 × 1011 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 12.0 MiB |
Quantile statistics
| Minimum | 1.780201 × 1011 |
|---|---|
| 5-th percentile | 2.620129012 × 1011 |
| Q1 | 2.740819017 × 1011 |
| median | 2.821010167 × 1011 |
| Q3 | 2.890606059 × 1011 |
| 95-th percentile | 2.950425037 × 1011 |
| Maximum | 5.200400698 × 1011 |
| Range | 3.420199698 × 1011 |
| Interquartile range (IQR) | 1.497870427 × 1010 |
Descriptive statistics
| Standard deviation | 1.039015273 × 1010 |
|---|---|
| Coefficient of variation (CV) | 0.03698756698 |
| Kurtosis | 0.3013487943 |
| Mean | 2.80909332 × 1011 |
| Median Absolute Deviation (MAD) | 7048912426 |
| Skewness | -0.6759298858 |
| Sum | 4.421625249 × 1017 |
| Variance | 1.079552738 × 1020 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2.521113001 × 1011 | 2 | < 0.1% |
| 2.890425046 × 1011 | 2 | < 0.1% |
| 2.800222058 × 1011 | 1 | < 0.1% |
| 2.740814062 × 1011 | 1 | < 0.1% |
| 2.930101066 × 1011 | 1 | < 0.1% |
| 2.890214048 × 1011 | 1 | < 0.1% |
| 2.78042011 × 1011 | 1 | < 0.1% |
| 2.820308003 × 1011 | 1 | < 0.1% |
| 2.91010136 × 1011 | 1 | < 0.1% |
| 2.83010158 × 1011 | 1 | < 0.1% |
| Other values (1574028) | 1574028 |
| Value | Count | Frequency (%) |
| 1.780201 × 1011 | 1 | |
| 1.891108 × 1011 | 1 | |
| 1.981207 × 1011 | 1 | |
| 2.080805 × 1011 | 1 | |
| 2.150204001 × 1011 | 1 | |
| 2.220301001 × 1011 | 1 | |
| 2.220515002 × 1011 | 1 | |
| 2.221222001 × 1011 | 1 | |
| 2.230701001 × 1011 | 1 | |
| 2.231208002 × 1011 | 1 |
| Value | Count | Frequency (%) |
| 5.200400698 × 1011 | 1 | |
| 3.140923029 × 1011 | 1 | |
| 3.130306023 × 1011 | 1 | |
| 3.031215015 × 1011 | 1 | |
| 3.021114013 × 1011 | 1 | |
| 3.021105012 × 1011 | 1 | |
| 3.021102008 × 1011 | 1 | |
| 3.020928016 × 1011 | 1 | |
| 3.020916012 × 1011 | 1 | |
| 3.020815011 × 1011 | 1 |
| Distinct | 97 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 500 |
| Missing (%) | < 0.1% |
| Memory size | 12.0 MiB |
| Minimum | 1878-01-01 00:00:00 |
|---|---|
| Maximum | 2049-01-01 00:00:00 |
| Distinct | 173 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 459.0476214 |
| Minimum | 0 |
|---|---|
| Maximum | 883 |
| Zeros | 9 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 12.0 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 106 |
| Q1 | 107 |
| median | 702 |
| Q3 | 709 |
| 95-th percentile | 722 |
| Maximum | 883 |
| Range | 883 |
| Interquartile range (IQR) | 602 |
Descriptive statistics
| Standard deviation | 297.7551975 |
|---|---|
| Coefficient of variation (CV) | 0.6486368378 |
| Kurtosis | -1.878648097 |
| Mean | 459.0476214 |
| Median Absolute Deviation (MAD) | 19 |
| Skewness | -0.3207666481 |
| Sum | 722559318 |
| Variance | 88658.15766 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 709 | 477043 | |
| 107 | 447370 | |
| 702 | 169610 | 10.8% |
| 721 | 72319 | 4.6% |
| 101 | 71308 | 4.5% |
| 722 | 67942 | 4.3% |
| 110 | 57882 | 3.7% |
| 720 | 47438 | 3.0% |
| 106 | 22864 | 1.5% |
| 711 | 20522 | 1.3% |
| Other values (163) | 119742 | 7.6% |
| Value | Count | Frequency (%) |
| 0 | 9 | < 0.1% |
| 101 | 71308 | 4.5% |
| 103 | 984 | 0.1% |
| 104 | 3218 | 0.2% |
| 105 | 21 | < 0.1% |
| 106 | 22864 | 1.5% |
| 107 | 447370 | |
| 108 | 4451 | 0.3% |
| 110 | 57882 | 3.7% |
| 111 | 18862 | 1.2% |
| Value | Count | Frequency (%) |
| 883 | 2 | < 0.1% |
| 882 | 7 | < 0.1% |
| 881 | 41 | < 0.1% |
| 880 | 43 | < 0.1% |
| 870 | 30 | < 0.1% |
| 860 | 795 | |
| 850 | 25 | < 0.1% |
| 839 | 94 | < 0.1% |
| 838 | 152 | < 0.1% |
| 837 | 4 | < 0.1% |
| Distinct | 173 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 143.8 MiB |
| الهنــد | |
|---|---|
| مصـــر | |
| بنجلاديش | |
| باكستان | |
| الكويت | |
| Other values (168) |
Length
| Max length | 26 |
|---|---|
| Median length | 7 |
| Mean length | 6.891821046 |
| Min length | 0 |
Characters and Unicode
| Total characters | 10848002 |
|---|---|
| Distinct characters | 34 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 15 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | الكويت |
|---|---|
| 2nd row | باكستان |
| 3rd row | مصـــر |
| 4th row | مصـــر |
| 5th row | مصـــر |
Common Values
| Value | Count | Frequency (%) |
| الهنــد | 477043 | |
| مصـــر | 447370 | |
| بنجلاديش | 169610 | 10.8% |
| باكستان | 72319 | 4.6% |
| الكويت | 71308 | 4.5% |
| الفلبين | 67942 | 4.3% |
| ســوريا | 57882 | 3.7% |
| نيبال | 47438 | 3.0% |
| الأردن | 22864 | 1.5% |
| ايــران | 20522 | 1.3% |
| Other values (163) | 119742 | 7.6% |
Length
| Value | Count | Frequency (%) |
| الهنــد | 477043 | |
| مصـــر | 447370 | |
| بنجلاديش | 169610 | 10.4% |
| باكستان | 72319 | 4.4% |
| الكويت | 71308 | 4.4% |
| الفلبين | 67942 | 4.2% |
| ســوريا | 57882 | 3.6% |
| نيبال | 47438 | 2.9% |
| الأردن | 22864 | 1.4% |
| ايــران | 20522 | 1.3% |
| Other values (183) | 175834 | 10.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| ـ | 2518286 | |
| ا | 1311399 | |
| ل | 1052841 | |
| ن | 1019183 | |
| د | 694604 | 6.4% |
| ر | 593713 | 5.5% |
| ي | 555628 | 5.1% |
| ه | 486954 | 4.5% |
| م | 475269 | 4.4% |
| ص | 453742 | 4.2% |
| Other values (24) | 1686383 |
Most occurring categories
| Value | Count | Frequency (%) |
| Other Letter | 8273601 | |
| Modifier Letter | 2518286 | 23.2% |
| Space Separator | 56114 | 0.5% |
| Dash Punctuation | 1 | < 0.1% |
Most frequent character per category
Other Letter
| Value | Count | Frequency (%) |
| ا | 1311399 | |
| ل | 1052841 | |
| ن | 1019183 | |
| د | 694604 | |
| ر | 593713 | |
| ي | 555628 | |
| ه | 486954 | 5.9% |
| م | 475269 | 5.7% |
| ص | 453742 | 5.5% |
| ب | 392287 | 4.7% |
| Other values (21) | 1237981 |
Modifier Letter
| Value | Count | Frequency (%) |
| ـ | 2518286 |
Space Separator
| Value | Count | Frequency (%) |
| 56114 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Arabic | 8273601 | |
| Common | 2574401 | 23.7% |
Most frequent character per script
Arabic
| Value | Count | Frequency (%) |
| ا | 1311399 | |
| ل | 1052841 | |
| ن | 1019183 | |
| د | 694604 | |
| ر | 593713 | |
| ي | 555628 | |
| ه | 486954 | 5.9% |
| م | 475269 | 5.7% |
| ص | 453742 | 5.5% |
| ب | 392287 | 4.7% |
| Other values (21) | 1237981 |
Common
| Value | Count | Frequency (%) |
| ـ | 2518286 | |
| 56114 | 2.2% | |
| - | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| Arabic | 10791887 | |
| ASCII | 56115 | 0.5% |
Most frequent character per block
Arabic
| Value | Count | Frequency (%) |
| ـ | 2518286 | |
| ا | 1311399 | |
| ل | 1052841 | |
| ن | 1019183 | |
| د | 694604 | 6.4% |
| ر | 593713 | 5.5% |
| ي | 555628 | 5.1% |
| ه | 486954 | 4.5% |
| م | 475269 | 4.4% |
| ص | 453742 | 4.2% |
| Other values (22) | 1630268 |
ASCII
| Value | Count | Frequency (%) |
| 56114 | ||
| - | 1 | < 0.1% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 78 |
| Missing (%) | < 0.1% |
| Memory size | 90.1 MiB |
| 1.0 | |
|---|---|
| 2.0 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 4721886 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2.0 |
|---|---|
| 2nd row | 1.0 |
| 3rd row | 1.0 |
| 4th row | 1.0 |
| 5th row | 1.0 |
Common Values
| Value | Count | Frequency (%) |
| 1.0 | 1406082 | |
| 2.0 | 167880 | 10.7% |
| (Missing) | 78 | < 0.1% |
Length
Pie chart
| Value | Count | Frequency (%) |
| 1.0 | 1406082 | |
| 2.0 | 167880 | 10.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 1573962 | |
| 0 | 1573962 | |
| 1 | 1406082 | |
| 2 | 167880 | 3.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3147924 | |
| Other Punctuation | 1573962 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1573962 | |
| 1 | 1406082 | |
| 2 | 167880 | 5.3% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1573962 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 4721886 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| . | 1573962 | |
| 0 | 1573962 | |
| 1 | 1406082 | |
| 2 | 167880 | 3.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4721886 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 1573962 | |
| 0 | 1573962 | |
| 1 | 1406082 | |
| 2 | 167880 | 3.6% |
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 132.4 MiB |
| ذكر | |
|---|---|
| انثى | |
| 78 |
Length
| Max length | 4 |
|---|---|
| Median length | 3 |
| Mean length | 3.106506823 |
| Min length | 0 |
Characters and Unicode
| Total characters | 4889766 |
|---|---|
| Distinct characters | 7 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | انثى |
|---|---|
| 2nd row | ذكر |
| 3rd row | ذكر |
| 4th row | ذكر |
| 5th row | ذكر |
Common Values
| Value | Count | Frequency (%) |
| ذكر | 1406082 | |
| انثى | 167880 | 10.7% |
| 78 | < 0.1% |
Length
Pie chart
| Value | Count | Frequency (%) |
| ذكر | 1406082 | |
| انثى | 167880 | 10.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| ذ | 1406082 | |
| ك | 1406082 | |
| ر | 1406082 | |
| ا | 167880 | 3.4% |
| ن | 167880 | 3.4% |
| ث | 167880 | 3.4% |
| ى | 167880 | 3.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Other Letter | 4889766 |
Most frequent character per category
Other Letter
| Value | Count | Frequency (%) |
| ذ | 1406082 | |
| ك | 1406082 | |
| ر | 1406082 | |
| ا | 167880 | 3.4% |
| ن | 167880 | 3.4% |
| ث | 167880 | 3.4% |
| ى | 167880 | 3.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Arabic | 4889766 |
Most frequent character per script
Arabic
| Value | Count | Frequency (%) |
| ذ | 1406082 | |
| ك | 1406082 | |
| ر | 1406082 | |
| ا | 167880 | 3.4% |
| ن | 167880 | 3.4% |
| ث | 167880 | 3.4% |
| ى | 167880 | 3.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| Arabic | 4889766 |
Most frequent character per block
Arabic
| Value | Count | Frequency (%) |
| ذ | 1406082 | |
| ك | 1406082 | |
| ر | 1406082 | |
| ا | 167880 | 3.4% |
| ن | 167880 | 3.4% |
| ث | 167880 | 3.4% |
| ى | 167880 | 3.4% |
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 71492 |
| Missing (%) | 4.5% |
| Memory size | 88.7 MiB |
| 1.0 | |
|---|---|
| 2.0 | |
| 0.0 | |
| 3.0 | 41858 |
| 4.0 | 1411 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 4507644 |
|---|---|
| Distinct characters | 6 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1.0 |
|---|---|
| 2nd row | 1.0 |
| 3rd row | 1.0 |
| 4th row | 1.0 |
| 5th row | 1.0 |
Common Values
| Value | Count | Frequency (%) |
| 1.0 | 1005174 | |
| 2.0 | 235153 | 14.9% |
| 0.0 | 218952 | 13.9% |
| 3.0 | 41858 | 2.7% |
| 4.0 | 1411 | 0.1% |
| (Missing) | 71492 | 4.5% |
Length
Pie chart
| Value | Count | Frequency (%) |
| 1.0 | 1005174 | |
| 2.0 | 235153 | 15.7% |
| 0.0 | 218952 | 14.6% |
| 3.0 | 41858 | 2.8% |
| 4.0 | 1411 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1721500 | |
| . | 1502548 | |
| 1 | 1005174 | |
| 2 | 235153 | 5.2% |
| 3 | 41858 | 0.9% |
| 4 | 1411 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3005096 | |
| Other Punctuation | 1502548 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1721500 | |
| 1 | 1005174 | |
| 2 | 235153 | 7.8% |
| 3 | 41858 | 1.4% |
| 4 | 1411 | < 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1502548 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 4507644 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 1721500 | |
| . | 1502548 | |
| 1 | 1005174 | |
| 2 | 235153 | 5.2% |
| 3 | 41858 | 0.9% |
| 4 | 1411 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4507644 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 1721500 | |
| . | 1502548 | |
| 1 | 1005174 | |
| 2 | 235153 | 5.2% |
| 3 | 41858 | 0.9% |
| 4 | 1411 | < 0.1% |
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 136.5 MiB |
| مسلم | |
|---|---|
| مسيحي | |
| ديانات أخري | |
| 71492 | |
| هندوسي | 41858 |
Length
| Max length | 11 |
|---|---|
| Median length | 4 |
| Mean length | 4.994615766 |
| Min length | 0 |
Characters and Unicode
| Total characters | 7861725 |
|---|---|
| Distinct characters | 17 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | |
|---|---|
| 2nd row | مسلم |
| 3rd row | مسلم |
| 4th row | مسلم |
| 5th row | مسلم |
Common Values
| Value | Count | Frequency (%) |
| مسلم | 1005174 | |
| مسيحي | 235153 | 14.9% |
| ديانات أخري | 218952 | 13.9% |
| 71492 | 4.5% | |
| هندوسي | 41858 | 2.7% |
| بوذي | 1411 | 0.1% |
Length
Pie chart
| Value | Count | Frequency (%) |
| مسلم | 1005174 | |
| مسيحي | 235153 | 13.7% |
| ديانات | 218952 | 12.7% |
| أخري | 218952 | 12.7% |
| هندوسي | 41858 | 2.4% |
| بوذي | 1411 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| م | 2245501 | |
| س | 1282185 | |
| ل | 1005174 | |
| ي | 951479 | |
| ا | 437904 | 5.6% |
| د | 260810 | 3.3% |
| ن | 260810 | 3.3% |
| ح | 235153 | 3.0% |
| ت | 218952 | 2.8% |
| 218952 | 2.8% | |
| Other values (7) | 744805 | 9.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Other Letter | 7642773 | |
| Space Separator | 218952 | 2.8% |
Most frequent character per category
Other Letter
| Value | Count | Frequency (%) |
| م | 2245501 | |
| س | 1282185 | |
| ل | 1005174 | |
| ي | 951479 | |
| ا | 437904 | 5.7% |
| د | 260810 | 3.4% |
| ن | 260810 | 3.4% |
| ح | 235153 | 3.1% |
| ت | 218952 | 2.9% |
| أ | 218952 | 2.9% |
| Other values (6) | 525853 | 6.9% |
Space Separator
| Value | Count | Frequency (%) |
| 218952 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Arabic | 7642773 | |
| Common | 218952 | 2.8% |
Most frequent character per script
Arabic
| Value | Count | Frequency (%) |
| م | 2245501 | |
| س | 1282185 | |
| ل | 1005174 | |
| ي | 951479 | |
| ا | 437904 | 5.7% |
| د | 260810 | 3.4% |
| ن | 260810 | 3.4% |
| ح | 235153 | 3.1% |
| ت | 218952 | 2.9% |
| أ | 218952 | 2.9% |
| Other values (6) | 525853 | 6.9% |
Common
| Value | Count | Frequency (%) |
| 218952 |
Most occurring blocks
| Value | Count | Frequency (%) |
| Arabic | 7642773 | |
| ASCII | 218952 | 2.8% |
Most frequent character per block
Arabic
| Value | Count | Frequency (%) |
| م | 2245501 | |
| س | 1282185 | |
| ل | 1005174 | |
| ي | 951479 | |
| ا | 437904 | 5.7% |
| د | 260810 | 3.4% |
| ن | 260810 | 3.4% |
| ح | 235153 | 3.1% |
| ت | 218952 | 2.9% |
| أ | 218952 | 2.9% |
| Other values (6) | 525853 | 6.9% |
ASCII
| Value | Count | Frequency (%) |
| 218952 |
JOB_CODE
Real number (ℝ≥0)
| Distinct | 2297 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 66553.96383 |
| Minimum | 0 |
|---|---|
| Maximum | 3631153 |
| Zeros | 12 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 12.0 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 6582 |
| Q1 | 43333 |
| median | 62450 |
| Q3 | 98515 |
| 95-th percentile | 99410 |
| Maximum | 3631153 |
| Range | 3631153 |
| Interquartile range (IQR) | 55182 |
Descriptive statistics
| Standard deviation | 33072.72806 |
|---|---|
| Coefficient of variation (CV) | 0.4969310039 |
| Kurtosis | 829.6390326 |
| Mean | 66553.96383 |
| Median Absolute Deviation (MAD) | 32540 |
| Skewness | 7.780544805 |
| Sum | 1.047586012 × 1011 |
| Variance | 1093805341 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 99320 | 164555 | 10.5% |
| 98515 | 107114 | 6.8% |
| 45290 | 99493 | 6.3% |
| 55220 | 63878 | 4.1% |
| 62190 | 39552 | 2.5% |
| 93990 | 32018 | 2.0% |
| 99890 | 27331 | 1.7% |
| 98630 | 27231 | 1.7% |
| 98565 | 23128 | 1.5% |
| 53250 | 20053 | 1.3% |
| Other values (2287) | 969687 |
| Value | Count | Frequency (%) |
| 0 | 12 | < 0.1% |
| 66 | 1 | < 0.1% |
| 100 | 241 | |
| 1110 | 14 | < 0.1% |
| 1120 | 2 | < 0.1% |
| 1140 | 1 | < 0.1% |
| 1190 | 281 | |
| 1191 | 1 | < 0.1% |
| 1192 | 2 | < 0.1% |
| 1193 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 3631153 | 9 | < 0.1% |
| 2212414 | 1 | < 0.1% |
| 2212164 | 4 | < 0.1% |
| 1431013 | 1 | < 0.1% |
| 436103 | 35 | < 0.1% |
| 436100 | 99 | < 0.1% |
| 99982 | 379 | < 0.1% |
| 99981 | 598 | < 0.1% |
| 99980 | 139 | < 0.1% |
| 99970 | 1680 |
| Distinct | 2285 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 152.9 MiB |
| عامل عادى خفيف | |
|---|---|
| سائق مركبه خفيفه | 107114 |
| بائع | 99493 |
| عامل نظافة | 63878 |
| عامل زراعى | 39552 |
| Other values (2280) |
Length
| Max length | 49 |
|---|---|
| Median length | 10 |
| Mean length | 9.950757287 |
| Min length | 0 |
Characters and Unicode
| Total characters | 15662890 |
|---|---|
| Distinct characters | 40 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 271 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | مسئول |
|---|---|
| 2nd row | حداد |
| 3rd row | نقاش |
| 4th row | نقاش |
| 5th row | فنى كهربائي |
Common Values
| Value | Count | Frequency (%) |
| عامل عادى خفيف | 164555 | 10.5% |
| سائق مركبه خفيفه | 107114 | 6.8% |
| بائع | 99493 | 6.3% |
| عامل نظافة | 63878 | 4.1% |
| عامل زراعى | 39552 | 2.5% |
| عامل انتاج | 32018 | 2.0% |
| عامل فنى | 27331 | 1.7% |
| سائق معدات ثقيلة | 27231 | 1.7% |
| سائق شاحنة | 23128 | 1.5% |
| جارسون | 20053 | 1.3% |
| Other values (2275) | 969687 |
Length
| Value | Count | Frequency (%) |
| عامل | 415522 | 13.1% |
| سائق | 209989 | 6.6% |
| خفيف | 168466 | 5.3% |
| عادى | 165901 | 5.2% |
| فنى | 131392 | 4.1% |
| مركبه | 110001 | 3.5% |
| خفيفه | 107114 | 3.4% |
| بائع | 103313 | 3.3% |
| نظافة | 63997 | 2.0% |
| مدير | 50289 | 1.6% |
| Other values (1458) | 1652698 |
Most occurring characters
| Value | Count | Frequency (%) |
| ا | 2080758 | 13.3% |
| 1605594 | 10.3% | |
| م | 1427625 | 9.1% |
| ع | 1006742 | 6.4% |
| ف | 887638 | 5.7% |
| ي | 768011 | 4.9% |
| ل | 763612 | 4.9% |
| ر | 694398 | 4.4% |
| ن | 593996 | 3.8% |
| ب | 587282 | 3.7% |
| Other values (30) | 5247234 |
Most occurring categories
| Value | Count | Frequency (%) |
| Other Letter | 14050278 | |
| Space Separator | 1605594 | 10.3% |
| Other Punctuation | 7010 | < 0.1% |
| Open Punctuation | 4 | < 0.1% |
| Close Punctuation | 4 | < 0.1% |
Most frequent character per category
Other Letter
| Value | Count | Frequency (%) |
| ا | 2080758 | |
| م | 1427625 | 10.2% |
| ع | 1006742 | 7.2% |
| ف | 887638 | 6.3% |
| ي | 768011 | 5.5% |
| ل | 763612 | 5.4% |
| ر | 694398 | 4.9% |
| ن | 593996 | 4.2% |
| ب | 587282 | 4.2% |
| د | 537296 | 3.8% |
| Other values (26) | 4702920 |
Space Separator
| Value | Count | Frequency (%) |
| 1605594 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 7010 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 4 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 4 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Arabic | 14050278 | |
| Common | 1612612 | 10.3% |
Most frequent character per script
Arabic
| Value | Count | Frequency (%) |
| ا | 2080758 | |
| م | 1427625 | 10.2% |
| ع | 1006742 | 7.2% |
| ف | 887638 | 6.3% |
| ي | 768011 | 5.5% |
| ل | 763612 | 5.4% |
| ر | 694398 | 4.9% |
| ن | 593996 | 4.2% |
| ب | 587282 | 4.2% |
| د | 537296 | 3.8% |
| Other values (26) | 4702920 |
Common
| Value | Count | Frequency (%) |
| 1605594 | ||
| / | 7010 | 0.4% |
| ( | 4 | < 0.1% |
| ) | 4 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| Arabic | 14050278 | |
| ASCII | 1612612 | 10.3% |
Most frequent character per block
Arabic
| Value | Count | Frequency (%) |
| ا | 2080758 | |
| م | 1427625 | 10.2% |
| ع | 1006742 | 7.2% |
| ف | 887638 | 6.3% |
| ي | 768011 | 5.5% |
| ل | 763612 | 5.4% |
| ر | 694398 | 4.9% |
| ن | 593996 | 4.2% |
| ب | 587282 | 4.2% |
| د | 537296 | 3.8% |
| Other values (26) | 4702920 |
ASCII
| Value | Count | Frequency (%) |
| 1605594 | ||
| / | 7010 | 0.4% |
| ( | 4 | < 0.1% |
| ) | 4 | < 0.1% |
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 96.1 MiB |
| PRIVATE |
|---|
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Characters and Unicode
| Total characters | 11018280 |
|---|---|
| Distinct characters | 7 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | PRIVATE |
|---|---|
| 2nd row | PRIVATE |
| 3rd row | PRIVATE |
| 4th row | PRIVATE |
| 5th row | PRIVATE |
Common Values
| Value | Count | Frequency (%) |
| PRIVATE | 1574040 |
Length
Pie chart
| Value | Count | Frequency (%) |
| private | 1574040 |
Most occurring characters
| Value | Count | Frequency (%) |
| P | 1574040 | |
| R | 1574040 | |
| I | 1574040 | |
| V | 1574040 | |
| A | 1574040 | |
| T | 1574040 | |
| E | 1574040 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 11018280 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 1574040 | |
| R | 1574040 | |
| I | 1574040 | |
| V | 1574040 | |
| A | 1574040 | |
| T | 1574040 | |
| E | 1574040 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 11018280 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| P | 1574040 | |
| R | 1574040 | |
| I | 1574040 | |
| V | 1574040 | |
| A | 1574040 | |
| T | 1574040 | |
| E | 1574040 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 11018280 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| P | 1574040 | |
| R | 1574040 | |
| I | 1574040 | |
| V | 1574040 | |
| A | 1574040 | |
| T | 1574040 | |
| E | 1574040 |
| Distinct | 981 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 5764 |
| Missing (%) | 0.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 60537.89821 |
| Minimum | 8 |
|---|---|
| Maximum | 3720003 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 12.0 MiB |
Quantile statistics
| Minimum | 8 |
|---|---|
| 5-th percentile | 11045 |
| Q1 | 51201 |
| median | 61246 |
| Q3 | 71141 |
| 95-th percentile | 95201 |
| Maximum | 3720003 |
| Range | 3719995 |
| Interquartile range (IQR) | 19940 |
Descriptive statistics
| Standard deviation | 33119.90792 |
|---|---|
| Coefficient of variation (CV) | 0.547093786 |
| Kurtosis | 3926.438931 |
| Mean | 60537.89821 |
| Median Absolute Deviation (MAD) | 9924 |
| Skewness | 39.20844929 |
| Sum | 9.494013285 × 1010 |
| Variance | 1096928301 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 61244 | 173082 | 11.0% |
| 61246 | 82254 | 5.2% |
| 63102 | 64775 | 4.1% |
| 61245 | 58902 | 3.7% |
| 51100 | 43661 | 2.8% |
| 62159 | 37392 | 2.4% |
| 71141 | 36597 | 2.3% |
| 92007 | 30410 | 1.9% |
| 11037 | 23693 | 1.5% |
| 51201 | 22449 | 1.4% |
| Other values (971) | 995061 |
| Value | Count | Frequency (%) |
| 8 | 1 | < 0.1% |
| 32 | 17 | < 0.1% |
| 51 | 27 | < 0.1% |
| 63 | 6 | < 0.1% |
| 73 | 5 | < 0.1% |
| 82 | 5 | < 0.1% |
| 83 | 11 | < 0.1% |
| 92 | 1138 | |
| 94 | 1 | < 0.1% |
| 112 | 56 | < 0.1% |
| Value | Count | Frequency (%) |
| 3720003 | 40 | < 0.1% |
| 931030 | 3 | < 0.1% |
| 931004 | 9 | < 0.1% |
| 931002 | 374 | < 0.1% |
| 931001 | 5 | < 0.1% |
| 854108 | 33 | < 0.1% |
| 391012 | 56 | < 0.1% |
| 391011 | 138 | < 0.1% |
| 390001 | 14 | < 0.1% |
| 111141 | 20888 |
| Distinct | 967 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 204.2 MiB |
| التجارة العامة و المقاولات | |
|---|---|
| الادارة العامة ( ادارة الشركات ) | 82254 |
| المطاعم | 64775 |
| التجارة العامة | 58902 |
| المقاولات العامة للمباني | 43664 |
| Other values (962) |
Length
| Max length | 104 |
|---|---|
| Median length | 26 |
| Mean length | 27.04982275 |
| Min length | 0 |
Characters and Unicode
| Total characters | 42577503 |
|---|---|
| Distinct characters | 44 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 23 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | التجارة العامة و المقاولات |
|---|---|
| 2nd row | التجارة العامة و المقاولات |
| 3rd row | التجارة العامة و المقاولات |
| 4th row | التجارة العامة و المقاولات |
| 5th row | مقاولات انشاءات كهربائية وميكانيكية مثل محطات توليد الكهرباء |
Common Values
| Value | Count | Frequency (%) |
| التجارة العامة و المقاولات | 173082 | 11.0% |
| الادارة العامة ( ادارة الشركات ) | 82254 | 5.2% |
| المطاعم | 64775 | 4.1% |
| التجارة العامة | 58902 | 3.7% |
| المقاولات العامة للمباني | 43664 | 2.8% |
| الاسواق المركزية | 37392 | 2.4% |
| نقل البضائع داخل الكويت | 36597 | 2.3% |
| مقاولات تنظيف المبانى و الشوارع | 30410 | 1.9% |
| اعمال هندسية وتوريد وانشاءات | 23693 | 1.5% |
| مقاولات انشاء ورصف الطرق والشوارع وغيرها | 22449 | 1.4% |
| Other values (957) | 1000822 |
Length
| Value | Count | Frequency (%) |
| العامة | 371632 | 5.8% |
| و | 334939 | 5.2% |
| 242675 | 3.8% | |
| التجارة | 232109 | 3.6% |
| المقاولات | 217400 | 3.4% |
| تجارة | 150639 | 2.4% |
| مقاولات | 135790 | 2.1% |
| ادارة | 90802 | 1.4% |
| الشركات | 82362 | 1.3% |
| الادارة | 82254 | 1.3% |
| Other values (1868) | 4467693 |
Most occurring characters
| Value | Count | Frequency (%) |
| ا | 8960727 | |
| 4853698 | ||
| ل | 4631080 | |
| ت | 2363159 | 5.6% |
| م | 2287871 | 5.4% |
| و | 2193804 | 5.2% |
| ة | 2064114 | 4.8% |
| ر | 1849187 | 4.3% |
| ي | 1781275 | 4.2% |
| ن | 1179066 | 2.8% |
| Other values (34) | 10413522 |
Most occurring categories
| Value | Count | Frequency (%) |
| Other Letter | 37400018 | |
| Space Separator | 4853698 | 11.4% |
| Open Punctuation | 152739 | 0.4% |
| Close Punctuation | 148328 | 0.3% |
| Other Punctuation | 16556 | < 0.1% |
| Dash Punctuation | 6164 | < 0.1% |
Most frequent character per category
Other Letter
| Value | Count | Frequency (%) |
| ا | 8960727 | |
| ل | 4631080 | |
| ت | 2363159 | 6.3% |
| م | 2287871 | 6.1% |
| و | 2193804 | 5.9% |
| ة | 2064114 | 5.5% |
| ر | 1849187 | 4.9% |
| ي | 1781275 | 4.8% |
| ن | 1179066 | 3.2% |
| ع | 1138231 | 3.0% |
| Other values (26) | 8951504 |
Other Punctuation
| Value | Count | Frequency (%) |
| ? | 15912 | |
| . | 572 | 3.5% |
| , | 68 | 0.4% |
| / | 4 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 4853698 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 152739 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 148328 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 6164 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Arabic | 37400018 | |
| Common | 5177485 | 12.2% |
Most frequent character per script
Arabic
| Value | Count | Frequency (%) |
| ا | 8960727 | |
| ل | 4631080 | |
| ت | 2363159 | 6.3% |
| م | 2287871 | 6.1% |
| و | 2193804 | 5.9% |
| ة | 2064114 | 5.5% |
| ر | 1849187 | 4.9% |
| ي | 1781275 | 4.8% |
| ن | 1179066 | 3.2% |
| ع | 1138231 | 3.0% |
| Other values (26) | 8951504 |
Common
| Value | Count | Frequency (%) |
| 4853698 | ||
| ( | 152739 | 3.0% |
| ) | 148328 | 2.9% |
| ? | 15912 | 0.3% |
| - | 6164 | 0.1% |
| . | 572 | < 0.1% |
| , | 68 | < 0.1% |
| / | 4 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| Arabic | 37400018 | |
| ASCII | 5177485 | 12.2% |
Most frequent character per block
Arabic
| Value | Count | Frequency (%) |
| ا | 8960727 | |
| ل | 4631080 | |
| ت | 2363159 | 6.3% |
| م | 2287871 | 6.1% |
| و | 2193804 | 5.9% |
| ة | 2064114 | 5.5% |
| ر | 1849187 | 4.9% |
| ي | 1781275 | 4.8% |
| ن | 1179066 | 3.2% |
| ع | 1138231 | 3.0% |
| Other values (26) | 8951504 |
ASCII
| Value | Count | Frequency (%) |
| 4853698 | ||
| ( | 152739 | 3.0% |
| ) | 148328 | 2.9% |
| ? | 15912 | 0.3% |
| - | 6164 | 0.1% |
| . | 572 | < 0.1% |
| , | 68 | < 0.1% |
| / | 4 | < 0.1% |
| Distinct | 55 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 88711 |
| Missing (%) | 5.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 180.4542314 |
| Minimum | -1 |
|---|---|
| Maximum | 9700800 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 826 |
| Negative (%) | 0.1% |
| Memory size | 12.0 MiB |
Quantile statistics
| Minimum | -1 |
|---|---|
| 5-th percentile | 14 |
| Q1 | 35 |
| median | 45 |
| Q3 | 45 |
| 95-th percentile | 55 |
| Maximum | 9700800 |
| Range | 9700801 |
| Interquartile range (IQR) | 10 |
Descriptive statistics
| Standard deviation | 15856.87138 |
|---|---|
| Coefficient of variation (CV) | 87.87198426 |
| Kurtosis | 190487.524 |
| Mean | 180.4542314 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 338.9212143 |
| Sum | 268033903 |
| Variance | 251440369.9 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 45 | 801314 | |
| 35 | 371240 | |
| 14 | 156902 | 10.0% |
| 55 | 104106 | 6.6% |
| 71 | 22210 | 1.4% |
| 20 | 18131 | 1.2% |
| 70 | 4910 | 0.3% |
| 13 | 2387 | 0.2% |
| 11 | 1684 | 0.1% |
| -1 | 826 | 0.1% |
| Other values (45) | 1619 | 0.1% |
| (Missing) | 88711 | 5.6% |
| Value | Count | Frequency (%) |
| -1 | 826 | 0.1% |
| 4 | 1 | < 0.1% |
| 10 | 683 | < 0.1% |
| 11 | 1684 | 0.1% |
| 12 | 734 | < 0.1% |
| 13 | 2387 | 0.2% |
| 14 | 156902 | 10.0% |
| 20 | 18131 | 1.2% |
| 35 | 371240 | |
| 45 | 801314 |
| Value | Count | Frequency (%) |
| 9700800 | 2 | < 0.1% |
| 979837 | 1 | < 0.1% |
| 979836 | 1 | < 0.1% |
| 979835 | 1 | < 0.1% |
| 979830 | 11 | < 0.1% |
| 979823 | 18 | |
| 979822 | 7 | < 0.1% |
| 979821 | 1 | < 0.1% |
| 979820 | 37 | |
| 979809 | 1 | < 0.1% |
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 136.8 MiB |
| متوسط | |
|---|---|
| ثانوية | |
| جامعى | |
| ابتدائي | |
| Other values (7) | 58499 |
Length
| Max length | 35 |
|---|---|
| Median length | 5 |
| Mean length | 5.268034485 |
| Min length | 0 |
Characters and Unicode
| Total characters | 8292097 |
|---|---|
| Distinct characters | 23 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | |
|---|---|
| 2nd row | متوسط |
| 3rd row | متوسط |
| 4th row | متوسط |
| 5th row | متوسط |
Common Values
| Value | Count | Frequency (%) |
| متوسط | 801314 | |
| ثانوية | 371240 | |
| جامعى | 144232 | 9.2% |
| ابتدائي | 104106 | 6.6% |
| 94649 | 6.0% | |
| خبرة وبدون مؤهل | 22210 | 1.4% |
| دبلوم | 18131 | 1.2% |
| جامعي | 12670 | 0.8% |
| دبلوم دراسات عليا سنة بعد الجامعى | 2387 | 0.2% |
| ماجستير | 1684 | 0.1% |
| Other values (2) | 1417 | 0.1% |
Length
| Value | Count | Frequency (%) |
| متوسط | 801314 | |
| ثانوية | 371240 | |
| جامعى | 144232 | 9.4% |
| ابتدائي | 104106 | 6.8% |
| وبدون | 22210 | 1.4% |
| مؤهل | 22210 | 1.4% |
| خبرة | 22210 | 1.4% |
| دبلوم | 21252 | 1.4% |
| جامعي | 12670 | 0.8% |
| دراسات | 3121 | 0.2% |
| Other values (7) | 14851 | 1.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| و | 1238909 | |
| م | 1006483 | |
| ت | 911642 | |
| س | 809240 | |
| ط | 801314 | |
| ا | 755060 | |
| ي | 492821 | 5.9% |
| ن | 397305 | 4.8% |
| ة | 395837 | 4.8% |
| ث | 371240 | 4.5% |
| Other values (13) | 1112246 |
Most occurring categories
| Value | Count | Frequency (%) |
| Other Letter | 8232072 | |
| Space Separator | 60025 | 0.7% |
Most frequent character per category
Other Letter
| Value | Count | Frequency (%) |
| و | 1238909 | |
| م | 1006483 | |
| ت | 911642 | |
| س | 809240 | |
| ط | 801314 | |
| ا | 755060 | |
| ي | 492821 | 6.0% |
| ن | 397305 | 4.8% |
| ة | 395837 | 4.8% |
| ث | 371240 | 4.5% |
| Other values (12) | 1052221 |
Space Separator
| Value | Count | Frequency (%) |
| 60025 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Arabic | 8232072 | |
| Common | 60025 | 0.7% |
Most frequent character per script
Arabic
| Value | Count | Frequency (%) |
| و | 1238909 | |
| م | 1006483 | |
| ت | 911642 | |
| س | 809240 | |
| ط | 801314 | |
| ا | 755060 | |
| ي | 492821 | 6.0% |
| ن | 397305 | 4.8% |
| ة | 395837 | 4.8% |
| ث | 371240 | 4.5% |
| Other values (12) | 1052221 |
Common
| Value | Count | Frequency (%) |
| 60025 |
Most occurring blocks
| Value | Count | Frequency (%) |
| Arabic | 8232072 | |
| ASCII | 60025 | 0.7% |
Most frequent character per block
Arabic
| Value | Count | Frequency (%) |
| و | 1238909 | |
| م | 1006483 | |
| ت | 911642 | |
| س | 809240 | |
| ط | 801314 | |
| ا | 755060 | |
| ي | 492821 | 6.0% |
| ن | 397305 | 4.8% |
| ة | 395837 | 4.8% |
| ث | 371240 | 4.5% |
| Other values (12) | 1052221 |
ASCII
| Value | Count | Frequency (%) |
| 60025 |
| Distinct | 1421 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 88711 |
| Missing (%) | 5.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 462556.0553 |
| Minimum | -1 |
|---|---|
| Maximum | 9700800 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 826 |
| Negative (%) | 0.1% |
| Memory size | 12.0 MiB |
Quantile statistics
| Minimum | -1 |
|---|---|
| 5-th percentile | 55 |
| Q1 | 353002 |
| median | 453501 |
| Q3 | 453501 |
| 95-th percentile | 971509 |
| Maximum | 9700800 |
| Range | 9700801 |
| Interquartile range (IQR) | 100499 |
Descriptive statistics
| Standard deviation | 237409.3255 |
|---|---|
| Coefficient of variation (CV) | 0.5132552537 |
| Kurtosis | 4.141190013 |
| Mean | 462556.0553 |
| Median Absolute Deviation (MAD) | 8 |
| Skewness | 0.7669988393 |
| Sum | 6.87047923 × 1011 |
| Variance | 5.636318783 × 1010 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 453501 | 737903 | |
| 353002 | 343902 | |
| 970080 | 92795 | 5.9% |
| 55 | 73994 | 4.7% |
| 453509 | 38187 | 2.4% |
| 554000 | 30112 | 1.9% |
| 972370 | 27334 | 1.7% |
| 45 | 25224 | 1.6% |
| 14 | 12670 | 0.8% |
| 970296 | 11182 | 0.7% |
| Other values (1411) | 92026 | 5.8% |
| (Missing) | 88711 | 5.6% |
| Value | Count | Frequency (%) |
| -1 | 826 | 0.1% |
| 4 | 1 | < 0.1% |
| 14 | 12670 | 0.8% |
| 20 | 1 | < 0.1% |
| 35 | 4 | < 0.1% |
| 45 | 25224 | 1.6% |
| 55 | 73994 | |
| 70 | 4910 | 0.3% |
| 353 | 1 | < 0.1% |
| 2117 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 9700800 | 2 | < 0.1% |
| 980401 | 5 | < 0.1% |
| 980312 | 19 | < 0.1% |
| 980308 | 7 | < 0.1% |
| 980305 | 1 | < 0.1% |
| 980296 | 133 | |
| 980295 | 1 | < 0.1% |
| 980293 | 6 | < 0.1% |
| 980292 | 6 | < 0.1% |
| 980273 | 52 | < 0.1% |
SALARY
Real number (ℝ≥0)
| Distinct | 7224 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 316.2937736 |
| Minimum | 1.425 |
|---|---|
| Maximum | 26000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 12.0 MiB |
Quantile statistics
| Minimum | 1.425 |
|---|---|
| 5-th percentile | 75 |
| Q1 | 100 |
| median | 180 |
| Q3 | 350 |
| 95-th percentile | 1000 |
| Maximum | 26000 |
| Range | 25998.575 |
| Interquartile range (IQR) | 250 |
Descriptive statistics
| Standard deviation | 446.9902569 |
|---|---|
| Coefficient of variation (CV) | 1.413212318 |
| Kurtosis | 108.3040281 |
| Mean | 316.2937736 |
| Median Absolute Deviation (MAD) | 80 |
| Skewness | 6.978051131 |
| Sum | 497858735.2 |
| Variance | 199800.2898 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 100 | 176864 | 11.2% |
| 250 | 128443 | 8.2% |
| 150 | 126880 | 8.1% |
| 75 | 113054 | 7.2% |
| 80 | 72402 | 4.6% |
| 120 | 70589 | 4.5% |
| 200 | 65688 | 4.2% |
| 300 | 49466 | 3.1% |
| 500 | 43864 | 2.8% |
| 450 | 40260 | 2.6% |
| Other values (7214) | 686529 |
| Value | Count | Frequency (%) |
| 1.425 | 1 | < 0.1% |
| 2 | 76 | |
| 2.1 | 22 | < 0.1% |
| 2.17 | 5 | < 0.1% |
| 2.2 | 2 | < 0.1% |
| 2.267 | 2 | < 0.1% |
| 2.3 | 1 | < 0.1% |
| 2.31 | 1 | < 0.1% |
| 2.33 | 11 | < 0.1% |
| 2.34 | 5 | < 0.1% |
| Value | Count | Frequency (%) |
| 26000 | 1 | < 0.1% |
| 25000 | 1 | < 0.1% |
| 24003.485 | 1 | < 0.1% |
| 22861 | 1 | < 0.1% |
| 20000 | 1 | < 0.1% |
| 17500 | 2 | < 0.1% |
| 15366 | 1 | < 0.1% |
| 15100 | 1 | < 0.1% |
| 15000 | 8 | |
| 14999 | 1 | < 0.1% |
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 1574039 |
| Missing (%) | > 99.9% |
| Memory size | 60.0 MiB |
| 1.0 |
|---|
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 3 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 1.0 |
|---|
Common Values
| Value | Count | Frequency (%) |
| 1.0 | 1 | < 0.1% |
| (Missing) | 1574039 |
Length
Pie chart
| Value | Count | Frequency (%) |
| 1.0 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 1 | |
| . | 1 | |
| 0 | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2 | |
| Other Punctuation | 1 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1 | |
| 0 | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 1 | |
| . | 1 | |
| 0 | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 1 | |
| . | 1 | |
| 0 | 1 |
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.006389291 |
| Minimum | 1 |
|---|---|
| Maximum | 7 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 12.0 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 4 |
| Q3 | 6 |
| 95-th percentile | 7 |
| Maximum | 7 |
| Range | 6 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 2.315616782 |
|---|---|
| Coefficient of variation (CV) | 0.5779809733 |
| Kurtosis | -1.549241881 |
| Mean | 4.006389291 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | -0.03514552843 |
| Sum | 6306217 |
| Variance | 5.362081081 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 378777 | |
| 7 | 354674 | |
| 5 | 221482 | |
| 6 | 186621 | |
| 2 | 179403 | |
| 3 | 153552 | |
| 4 | 99531 | 6.3% |
| Value | Count | Frequency (%) |
| 1 | 378777 | |
| 2 | 179403 | |
| 3 | 153552 | |
| 4 | 99531 | 6.3% |
| 5 | 221482 | |
| 6 | 186621 | |
| 7 | 354674 |
| Value | Count | Frequency (%) |
| 7 | 354674 | |
| 6 | 186621 | |
| 5 | 221482 | |
| 4 | 99531 | 6.3% |
| 3 | 153552 | |
| 2 | 179403 | |
| 1 | 378777 |
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 167.4 MiB |
| محافظة العاصمة | |
|---|---|
| العقود الحكومية | |
| محافظة الفروانية | |
| محافظة مبارك الكبير | |
| محافظة حولي | |
| Other values (2) |
Length
| Max length | 19 |
|---|---|
| Median length | 14 |
| Mean length | 14.75762624 |
| Min length | 11 |
Characters and Unicode
| Total characters | 23229094 |
|---|---|
| Distinct characters | 21 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | محافظة الفروانية |
|---|---|
| 2nd row | محافظة الفروانية |
| 3rd row | محافظة الفروانية |
| 4th row | محافظة الفروانية |
| 5th row | محافظة الفروانية |
Common Values
| Value | Count | Frequency (%) |
| محافظة العاصمة | 378777 | |
| العقود الحكومية | 354674 | |
| محافظة الفروانية | 221482 | |
| محافظة مبارك الكبير | 186621 | |
| محافظة حولي | 179403 | |
| محافظة الاحمدي | 153552 | |
| محافظة الجهراء | 99531 | 6.3% |
Length
Pie chart
| Value | Count | Frequency (%) |
| محافظة | 1219366 | |
| العاصمة | 378777 | 11.4% |
| الحكومية | 354674 | 10.6% |
| العقود | 354674 | 10.6% |
| الفروانية | 221482 | 6.6% |
| مبارك | 186621 | 5.6% |
| الكبير | 186621 | 5.6% |
| حولي | 179403 | 5.4% |
| الاحمدي | 153552 | 4.6% |
| الجهراء | 99531 | 3.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| ا | 4008640 | |
| م | 2292990 | |
| ة | 2174299 | |
| ل | 1928714 | |
| ح | 1906995 | |
| 1760661 | ||
| ف | 1440848 | 6.2% |
| ظ | 1219366 | 5.2% |
| و | 1110233 | 4.8% |
| ي | 1095732 | 4.7% |
| Other values (11) | 4290616 |
Most occurring categories
| Value | Count | Frequency (%) |
| Other Letter | 21468433 | |
| Space Separator | 1760661 | 7.6% |
Most frequent character per category
Other Letter
| Value | Count | Frequency (%) |
| ا | 4008640 | |
| م | 2292990 | |
| ة | 2174299 | |
| ل | 1928714 | |
| ح | 1906995 | |
| ف | 1440848 | 6.7% |
| ظ | 1219366 | 5.7% |
| و | 1110233 | 5.2% |
| ي | 1095732 | 5.1% |
| ع | 733451 | 3.4% |
| Other values (10) | 3557165 |
Space Separator
| Value | Count | Frequency (%) |
| 1760661 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Arabic | 21468433 | |
| Common | 1760661 | 7.6% |
Most frequent character per script
Arabic
| Value | Count | Frequency (%) |
| ا | 4008640 | |
| م | 2292990 | |
| ة | 2174299 | |
| ل | 1928714 | |
| ح | 1906995 | |
| ف | 1440848 | 6.7% |
| ظ | 1219366 | 5.7% |
| و | 1110233 | 5.2% |
| ي | 1095732 | 5.1% |
| ع | 733451 | 3.4% |
| Other values (10) | 3557165 |
Common
| Value | Count | Frequency (%) |
| 1760661 |
Most occurring blocks
| Value | Count | Frequency (%) |
| Arabic | 21468433 | |
| ASCII | 1760661 | 7.6% |
Most frequent character per block
Arabic
| Value | Count | Frequency (%) |
| ا | 4008640 | |
| م | 2292990 | |
| ة | 2174299 | |
| ل | 1928714 | |
| ح | 1906995 | |
| ف | 1440848 | 6.7% |
| ظ | 1219366 | 5.7% |
| و | 1110233 | 5.2% |
| ي | 1095732 | 5.1% |
| ع | 733451 | 3.4% |
| Other values (10) | 3557165 |
ASCII
| Value | Count | Frequency (%) |
| 1760661 |
| Distinct | 9 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.836254479 |
| Minimum | 1 |
|---|---|
| Maximum | 11 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 12.0 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 2 |
| Q3 | 2 |
| 95-th percentile | 2 |
| Maximum | 11 |
| Range | 10 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.5376797457 |
|---|---|
| Coefficient of variation (CV) | 0.2928133066 |
| Kurtosis | 13.28246702 |
| Mean | 1.836254479 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.804203682 |
| Sum | 2890338 |
| Variance | 0.2890995089 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 1232659 | |
| 1 | 319096 | 20.3% |
| 5 | 19116 | 1.2% |
| 3 | 2394 | 0.2% |
| 4 | 764 | < 0.1% |
| 10 | 6 | < 0.1% |
| 11 | 3 | < 0.1% |
| 6 | 1 | < 0.1% |
| 7 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 1 | 319096 | 20.3% |
| 2 | 1232659 | |
| 3 | 2394 | 0.2% |
| 4 | 764 | < 0.1% |
| 5 | 19116 | 1.2% |
| 6 | 1 | < 0.1% |
| 7 | 1 | < 0.1% |
| 10 | 6 | < 0.1% |
| 11 | 3 | < 0.1% |
| Value | Count | Frequency (%) |
| 11 | 3 | < 0.1% |
| 10 | 6 | < 0.1% |
| 7 | 1 | < 0.1% |
| 6 | 1 | < 0.1% |
| 5 | 19116 | 1.2% |
| 4 | 764 | < 0.1% |
| 3 | 2394 | 0.2% |
| 2 | 1232659 | |
| 1 | 319096 | 20.3% |
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 137.0 MiB |
| متزوج | |
|---|---|
| أعزب | |
| 15968 | |
| غير معرف | 3159 |
| مطلق | 2394 |
Length
| Max length | 8 |
|---|---|
| Median length | 5 |
| Mean length | 4.75056733 |
| Min length | 0 |
Characters and Unicode
| Total characters | 7477583 |
|---|---|
| Distinct characters | 16 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | متزوج |
|---|---|
| 2nd row | متزوج |
| 3rd row | أعزب |
| 4th row | متزوج |
| 5th row | متزوج |
Common Values
| Value | Count | Frequency (%) |
| متزوج | 1232659 | |
| أعزب | 319096 | 20.3% |
| 15968 | 1.0% | |
| غير معرف | 3159 | 0.2% |
| مطلق | 2394 | 0.2% |
| أرمل | 764 | < 0.1% |
Length
Pie chart
| Value | Count | Frequency (%) |
| متزوج | 1232659 | |
| أعزب | 319096 | 20.4% |
| معرف | 3159 | 0.2% |
| غير | 3159 | 0.2% |
| مطلق | 2394 | 0.2% |
| أرمل | 764 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| ز | 1551755 | |
| م | 1238976 | |
| ت | 1232659 | |
| و | 1232659 | |
| ج | 1232659 | |
| ع | 322255 | 4.3% |
| أ | 319860 | 4.3% |
| ب | 319096 | 4.3% |
| ر | 7082 | 0.1% |
| غ | 3159 | < 0.1% |
| Other values (6) | 17423 | 0.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Other Letter | 7474424 | |
| Space Separator | 3159 | < 0.1% |
Most frequent character per category
Other Letter
| Value | Count | Frequency (%) |
| ز | 1551755 | |
| م | 1238976 | |
| ت | 1232659 | |
| و | 1232659 | |
| ج | 1232659 | |
| ع | 322255 | 4.3% |
| أ | 319860 | 4.3% |
| ب | 319096 | 4.3% |
| ر | 7082 | 0.1% |
| غ | 3159 | < 0.1% |
| Other values (5) | 14264 | 0.2% |
Space Separator
| Value | Count | Frequency (%) |
| 3159 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Arabic | 7474424 | |
| Common | 3159 | < 0.1% |
Most frequent character per script
Arabic
| Value | Count | Frequency (%) |
| ز | 1551755 | |
| م | 1238976 | |
| ت | 1232659 | |
| و | 1232659 | |
| ج | 1232659 | |
| ع | 322255 | 4.3% |
| أ | 319860 | 4.3% |
| ب | 319096 | 4.3% |
| ر | 7082 | 0.1% |
| غ | 3159 | < 0.1% |
| Other values (5) | 14264 | 0.2% |
Common
| Value | Count | Frequency (%) |
| 3159 |
Most occurring blocks
| Value | Count | Frequency (%) |
| Arabic | 7474424 | |
| ASCII | 3159 | < 0.1% |
Most frequent character per block
Arabic
| Value | Count | Frequency (%) |
| ز | 1551755 | |
| م | 1238976 | |
| ت | 1232659 | |
| و | 1232659 | |
| ج | 1232659 | |
| ع | 322255 | 4.3% |
| أ | 319860 | 4.3% |
| ب | 319096 | 4.3% |
| ر | 7082 | 0.1% |
| غ | 3159 | < 0.1% |
| Other values (5) | 14264 | 0.2% |
ASCII
| Value | Count | Frequency (%) |
| 3159 |
| Distinct | 122145 |
|---|---|
| Distinct (%) | 7.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 247.6 MiB |
| مبنى الركاب الجديد بمطار الكويت (المبنى 11) | 7109 |
|---|---|
| شركة محمد حمود الشايع | 5525 |
| الشركة الاحمدية للمقاولات والتجارة | 5292 |
| (10377) شركة مطاحن الدقيق والمخابز الكويتية | 4613 |
| (ادارة العقودالحكوميه(اعادة قيد | 3934 |
| Other values (122140) |
Length
| Max length | 128 |
|---|---|
| Median length | 37 |
| Mean length | 41.46124431 |
| Min length | 3 |
Characters and Unicode
| Total characters | 65261657 |
|---|---|
| Distinct characters | 97 |
| Distinct categories | 13 ? |
| Distinct scripts | 4 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 25127 ? |
|---|---|
| Unique (%) | 1.6% |
Sample
| 1st row | مؤسسة الرهيب الوطنية للتجارة العامة والمقاولات |
|---|---|
| 2nd row | مؤسسة الرهيب الوطنية للتجارة العامة والمقاولات |
| 3rd row | مؤسسة الرهيب الوطنية للتجارة العامة والمقاولات |
| 4th row | مؤسسة الرهيب الوطنية للتجارة العامة والمقاولات |
| 5th row | شركة الفيالق الكويتية للتجارة العامة والمقاولات |
Common Values
| Value | Count | Frequency (%) |
| مبنى الركاب الجديد بمطار الكويت (المبنى 11) | 7109 | 0.5% |
| شركة محمد حمود الشايع | 5525 | 0.4% |
| الشركة الاحمدية للمقاولات والتجارة | 5292 | 0.3% |
| (10377) شركة مطاحن الدقيق والمخابز الكويتية | 4613 | 0.3% |
| (ادارة العقودالحكوميه(اعادة قيد | 3934 | 0.2% |
| مركز التجمع الجديد في جنوب وشرق الكويت GC-32 | 3756 | 0.2% |
| إنشاء وانجاز وصيانة مشروع معسكر الشيخ سالم العلي السالم الصباح | 3627 | 0.2% |
| شركة بدر الملا واخوانه | 3616 | 0.2% |
| مشروع خط انابيب التغذية لشركة نفط الكويت للمصفاة لجديدة NRP | 3590 | 0.2% |
| الشركة الكويتية للاغذية ( الامريكانا ) | 3403 | 0.2% |
| Other values (122135) | 1529575 |
Length
| Value | Count | Frequency (%) |
| شركة | 585643 | 6.1% |
| العامة | 212020 | 2.2% |
| للتجارة | 175881 | 1.8% |
| والمقاولات | 143362 | 1.5% |
| 131687 | 1.4% | |
| الكويت | 100820 | 1.0% |
| خدمات | 89090 | 0.9% |
| وصيانة | 68642 | 0.7% |
| اعمال | 61724 | 0.6% |
| العامه | 56008 | 0.6% |
| Other values (47598) | 8035801 |
Most occurring characters
| Value | Count | Frequency (%) |
| ا | 9805328 | |
| 8845785 | ||
| ل | 7537818 | |
| ي | 3670610 | 5.6% |
| م | 3588597 | 5.5% |
| و | 3061984 | 4.7% |
| ر | 3002295 | 4.6% |
| ة | 2934440 | 4.5% |
| ت | 2636526 | 4.0% |
| ن | 1980191 | 3.0% |
| Other values (87) | 18198083 |
Most occurring categories
| Value | Count | Frequency (%) |
| Other Letter | 55405898 | |
| Space Separator | 8845785 | 13.6% |
| Decimal Number | 442486 | 0.7% |
| Open Punctuation | 143008 | 0.2% |
| Close Punctuation | 126191 | 0.2% |
| Other Punctuation | 117674 | 0.2% |
| Dash Punctuation | 107636 | 0.2% |
| Uppercase Letter | 49204 | 0.1% |
| Modifier Letter | 23150 | < 0.1% |
| Lowercase Letter | 415 | < 0.1% |
| Other values (3) | 210 | < 0.1% |
Most frequent character per category
Other Letter
| Value | Count | Frequency (%) |
| ا | 9805328 | |
| ل | 7537818 | |
| ي | 3670610 | 6.6% |
| م | 3588597 | 6.5% |
| و | 3061984 | 5.5% |
| ر | 3002295 | 5.4% |
| ة | 2934440 | 5.3% |
| ت | 2636526 | 4.8% |
| ن | 1980191 | 3.6% |
| ع | 1645339 | 3.0% |
| Other values (26) | 15542770 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 7089 | |
| C | 6631 | |
| G | 6543 | |
| R | 5483 | |
| P | 4954 | |
| L | 2911 | 5.9% |
| A | 2521 | 5.1% |
| B | 1793 | 3.6% |
| H | 1632 | 3.3% |
| I | 1524 | 3.1% |
| Other values (13) | 8123 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 95413 | |
| . | 13342 | 11.3% |
| , | 6256 | 5.3% |
| & | 1949 | 1.7% |
| ، | 384 | 0.3% |
| " | 256 | 0.2% |
| \ | 49 | < 0.1% |
| * | 16 | < 0.1% |
| : | 7 | < 0.1% |
| % | 2 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 111811 | |
| 0 | 69668 | |
| 2 | 59821 | |
| 3 | 47456 | |
| 7 | 28861 | 6.5% |
| 5 | 28505 | 6.4% |
| 4 | 28474 | 6.4% |
| 8 | 23576 | 5.3% |
| 6 | 23569 | 5.3% |
| 9 | 20745 | 4.7% |
Nonspacing Mark
| Value | Count | Frequency (%) |
| ٌ | 18 | |
| ً | 18 | |
| ُ | 15 | |
| َ | 9 | |
| ِ | 4 | 6.2% |
Math Symbol
| Value | Count | Frequency (%) |
| + | 55 | |
| > | 3 | 5.1% |
| | | 1 | 1.7% |
Lowercase Letter
| Value | Count | Frequency (%) |
| l | 412 | |
| x | 2 | 0.5% |
| z | 1 | 0.2% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 126190 | |
| ] | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 8845785 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 143008 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 107636 |
Modifier Letter
| Value | Count | Frequency (%) |
| ـ | 23150 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 87 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Arabic | 55405898 | |
| Common | 9806076 | 15.0% |
| Latin | 49619 | 0.1% |
| Inherited | 64 | < 0.1% |
Most frequent character per script
Arabic
| Value | Count | Frequency (%) |
| ا | 9805328 | |
| ل | 7537818 | |
| ي | 3670610 | 6.6% |
| م | 3588597 | 6.5% |
| و | 3061984 | 5.5% |
| ر | 3002295 | 5.4% |
| ة | 2934440 | 5.3% |
| ت | 2636526 | 4.8% |
| ن | 1980191 | 3.6% |
| ع | 1645339 | 3.0% |
| Other values (26) | 15542770 |
Common
| Value | Count | Frequency (%) |
| 8845785 | ||
| ( | 143008 | 1.5% |
| ) | 126190 | 1.3% |
| 1 | 111811 | 1.1% |
| - | 107636 | 1.1% |
| / | 95413 | 1.0% |
| 0 | 69668 | 0.7% |
| 2 | 59821 | 0.6% |
| 3 | 47456 | 0.5% |
| 7 | 28861 | 0.3% |
| Other values (20) | 170427 | 1.7% |
Latin
| Value | Count | Frequency (%) |
| N | 7089 | |
| C | 6631 | |
| G | 6543 | |
| R | 5483 | |
| P | 4954 | |
| L | 2911 | 5.9% |
| A | 2521 | 5.1% |
| B | 1793 | 3.6% |
| H | 1632 | 3.3% |
| I | 1524 | 3.1% |
| Other values (16) | 8538 |
Inherited
| Value | Count | Frequency (%) |
| ٌ | 18 | |
| ً | 18 | |
| ُ | 15 | |
| َ | 9 | |
| ِ | 4 | 6.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| Arabic | 55429496 | |
| ASCII | 9832161 | 15.1% |
Most frequent character per block
Arabic
| Value | Count | Frequency (%) |
| ا | 9805328 | |
| ل | 7537818 | |
| ي | 3670610 | 6.6% |
| م | 3588597 | 6.5% |
| و | 3061984 | 5.5% |
| ر | 3002295 | 5.4% |
| ة | 2934440 | 5.3% |
| ت | 2636526 | 4.8% |
| ن | 1980191 | 3.6% |
| ع | 1645339 | 3.0% |
| Other values (33) | 15566368 |
ASCII
| Value | Count | Frequency (%) |
| 8845785 | ||
| ( | 143008 | 1.5% |
| ) | 126190 | 1.3% |
| 1 | 111811 | 1.1% |
| - | 107636 | 1.1% |
| / | 95413 | 1.0% |
| 0 | 69668 | 0.7% |
| 2 | 59821 | 0.6% |
| 3 | 47456 | 0.5% |
| 7 | 28861 | 0.3% |
| Other values (44) | 196512 | 2.0% |
HIRE_DATE
Date
| Distinct | 4826 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 42 |
| Missing (%) | < 0.1% |
| Memory size | 12.0 MiB |
| Minimum | 1966-12-12 00:00:00 |
|---|---|
| Maximum | 2020-12-31 00:00:00 |
| Distinct | 112316 |
|---|---|
| Distinct (%) | 9.2% |
| Missing | 354729 |
| Missing (%) | 22.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 16104080.23 |
| Minimum | 0 |
|---|---|
| Maximum | 99999999 |
| Zeros | 1189 |
| Zeros (%) | 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 12.0 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 10222598 |
| Q1 | 13214876 |
| median | 16388361 |
| Q3 | 18894778 |
| 95-th percentile | 20882688 |
| Maximum | 99999999 |
| Range | 99999999 |
| Interquartile range (IQR) | 5679902 |
Descriptive statistics
| Standard deviation | 4275012.298 |
|---|---|
| Coefficient of variation (CV) | 0.2654614381 |
| Kurtosis | 134.9536085 |
| Mean | 16104080.23 |
| Median Absolute Deviation (MAD) | 2654096 |
| Skewness | 6.752699145 |
| Sum | 1.963588217 × 1013 |
| Variance | 1.827573015 × 1013 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 20679196 | 5507 | 0.3% |
| 10555878 | 5292 | 0.3% |
| 10595474 | 4613 | 0.3% |
| 20774206 | 3470 | 0.2% |
| 10615834 | 3401 | 0.2% |
| 21107704 | 2949 | 0.2% |
| 10244973 | 2633 | 0.2% |
| 20243296 | 2451 | 0.2% |
| 10310395 | 2368 | 0.2% |
| 10081625 | 2245 | 0.1% |
| Other values (112306) | 1184382 | |
| (Missing) | 354729 | 22.5% |
| Value | Count | Frequency (%) |
| 0 | 1189 | |
| 10000012 | 214 | < 0.1% |
| 10000143 | 31 | < 0.1% |
| 10000151 | 1 | < 0.1% |
| 10000186 | 11 | < 0.1% |
| 10000194 | 16 | < 0.1% |
| 10000354 | 5 | < 0.1% |
| 10000688 | 1 | < 0.1% |
| 10001242 | 4 | < 0.1% |
| 10001293 | 24 | < 0.1% |
| Value | Count | Frequency (%) |
| 99999999 | 1126 | |
| 21482686 | 1 | < 0.1% |
| 21474942 | 14 | < 0.1% |
| 21473341 | 8 | < 0.1% |
| 21472568 | 2 | < 0.1% |
| 21471872 | 2 | < 0.1% |
| 21471717 | 1 | < 0.1% |
| 21471709 | 12 | < 0.1% |
| 21470968 | 1 | < 0.1% |
| 21469641 | 4 | < 0.1% |
| Distinct | 83888 |
|---|---|
| Distinct (%) | 5.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.263286075 × 1011 |
| Minimum | 18200000 |
|---|---|
| Maximum | 9.99779 × 1011 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 12.0 MiB |
Quantile statistics
| Minimum | 18200000 |
|---|---|
| 5-th percentile | 1.0202 × 1010 |
| Q1 | 1.276715 × 1011 |
| median | 2.133913 × 1011 |
| Q3 | 2.80479 × 1011 |
| 95-th percentile | 5.97241 × 1011 |
| Maximum | 9.99779 × 1011 |
| Range | 9.997608 × 1011 |
| Interquartile range (IQR) | 1.528075 × 1011 |
Descriptive statistics
| Standard deviation | 1.640134387 × 1011 |
|---|---|
| Coefficient of variation (CV) | 0.7246694992 |
| Kurtosis | 5.118960536 |
| Mean | 2.263286075 × 1011 |
| Median Absolute Deviation (MAD) | 7.868980069 × 1010 |
| Skewness | 1.863400769 |
| Sum | 3.562502814 × 1017 |
| Variance | 2.690040806 × 1022 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1.276715 × 1011 | 10922 | 0.7% |
| 2.53696 × 1011 | 5292 | 0.3% |
| 1.05998 × 1011 | 5062 | 0.3% |
| 2201600033 | 4750 | 0.3% |
| 1.523245 × 1011 | 4613 | 0.3% |
| 3.67998 × 1011 | 4037 | 0.3% |
| 7.777777778 × 1011 | 3934 | 0.2% |
| 5.86047 × 1011 | 3656 | 0.2% |
| 1.42019 × 1010 | 3627 | 0.2% |
| 2.24213 × 1011 | 3462 | 0.2% |
| Other values (83878) | 1524685 |
| Value | Count | Frequency (%) |
| 18200000 | 80 | < 0.1% |
| 130120000 | 3 | < 0.1% |
| 201600001 | 38 | < 0.1% |
| 1201400003 | 1 | < 0.1% |
| 1201400009 | 989 | |
| 1201500006 | 2 | < 0.1% |
| 1201500007 | 96 | < 0.1% |
| 1201500017 | 9 | < 0.1% |
| 1201500020 | 2 | < 0.1% |
| 1201600002 | 340 | < 0.1% |
| Value | Count | Frequency (%) |
| 9.99779 × 1011 | 8 | < 0.1% |
| 9.9901 × 1011 | 4 | < 0.1% |
| 9.9712 × 1011 | 40 | < 0.1% |
| 9.96836 × 1011 | 4 | < 0.1% |
| 9.9637 × 1011 | 105 | |
| 9.96326 × 1011 | 7 | < 0.1% |
| 9.9632 × 1011 | 17 | < 0.1% |
| 9.9621 × 1011 | 85 | |
| 9.96111 × 1011 | 7 | < 0.1% |
| 9.95898 × 1011 | 19 | < 0.1% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 90.1 MiB |
| 2.0 | |
|---|---|
| 1.0 | 71308 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 4722120 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1.0 |
|---|---|
| 2nd row | 2.0 |
| 3rd row | 2.0 |
| 4th row | 2.0 |
| 5th row | 2.0 |
Common Values
| Value | Count | Frequency (%) |
| 2.0 | 1502732 | |
| 1.0 | 71308 | 4.5% |
Length
Pie chart
| Value | Count | Frequency (%) |
| 2.0 | 1502732 | |
| 1.0 | 71308 | 4.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 1574040 | |
| 0 | 1574040 | |
| 2 | 1502732 | |
| 1 | 71308 | 1.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3148080 | |
| Other Punctuation | 1574040 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1574040 | |
| 2 | 1502732 | |
| 1 | 71308 | 2.3% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1574040 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 4722120 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| . | 1574040 | |
| 0 | 1574040 | |
| 2 | 1502732 | |
| 1 | 71308 | 1.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4722120 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 1574040 | |
| 0 | 1574040 | |
| 2 | 1502732 | |
| 1 | 71308 | 1.5% |
| Distinct | 97 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 500 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 40.54873786 |
| Minimum | -27.6 |
|---|---|
| Maximum | 143.39 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 25 |
| Negative (%) | < 0.1% |
| Memory size | 12.0 MiB |
Quantile statistics
| Minimum | -27.6 |
|---|---|
| 5-th percentile | 26.4 |
| Q1 | 32.4 |
| median | 39.4 |
| Q3 | 47.4 |
| 95-th percentile | 59.4 |
| Maximum | 143.39 |
| Range | 170.99 |
| Interquartile range (IQR) | 15 |
Descriptive statistics
| Standard deviation | 10.38886196 |
|---|---|
| Coefficient of variation (CV) | 0.2562067898 |
| Kurtosis | 0.1404874705 |
| Mean | 40.54873786 |
| Median Absolute Deviation (MAD) | 7 |
| Skewness | 0.6786459787 |
| Sum | 63805060.98 |
| Variance | 107.9284528 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 33.4 | 65641 | 4.2% |
| 34.4 | 59774 | 3.8% |
| 30.4 | 58931 | 3.7% |
| 36.4 | 58435 | 3.7% |
| 35.4 | 58112 | 3.7% |
| 37.4 | 57373 | 3.6% |
| 31.4 | 57270 | 3.6% |
| 39.4 | 56847 | 3.6% |
| 29.4 | 56816 | 3.6% |
| 38.4 | 56737 | 3.6% |
| Other values (87) | 987604 |
| Value | Count | Frequency (%) |
| -27.6 | 7 | |
| -26.6 | 2 | < 0.1% |
| -25.6 | 4 | |
| -24.6 | 2 | < 0.1% |
| -23.6 | 1 | < 0.1% |
| -22.6 | 3 | |
| -21.6 | 1 | < 0.1% |
| -20.6 | 1 | < 0.1% |
| -18.6 | 1 | < 0.1% |
| -16.6 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 143.39 | 1 | < 0.1% |
| 132.39 | 1 | < 0.1% |
| 113.4 | 1 | < 0.1% |
| 99.4 | 3 | |
| 98.4 | 2 | < 0.1% |
| 96.4 | 2 | < 0.1% |
| 95.4 | 3 | |
| 94.4 | 4 | |
| 93.4 | 7 | |
| 92.4 | 6 |
| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 587 |
| Missing (%) | < 0.1% |
| Memory size | 1.5 MiB |
| 40-49 | |
|---|---|
| 50-59 | |
| 30-39 | |
| 60-69 | |
| 70-79 | |
| Other values (3) | 11216 |
Length
| Max length | 16 |
|---|---|
| Median length | 5 |
| Mean length | 5.00480599 |
| Min length | 5 |
Characters and Unicode
| Total characters | 7874827 |
|---|---|
| Distinct characters | 20 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 60-69 |
|---|---|
| 2nd row | 50-59 |
| 3rd row | 50-59 |
| 4th row | 60-69 |
| 5th row | 30-39 |
Common Values
| Value | Count | Frequency (%) |
| 40-49 | 585564 | |
| 50-59 | 440295 | |
| 30-39 | 248027 | |
| 60-69 | 222444 | 14.1% |
| 70-79 | 65907 | 4.2% |
| 80-89 | 9989 | 0.6% |
| Not Defined | 1187 | 0.1% |
| Not Defined20-29 | 40 | < 0.1% |
| (Missing) | 587 | < 0.1% |
Length
Pie chart
| Value | Count | Frequency (%) |
| 40-49 | 585564 | |
| 50-59 | 440295 | |
| 30-39 | 248027 | |
| 60-69 | 222444 | 14.1% |
| 70-79 | 65907 | 4.2% |
| 80-89 | 9989 | 0.6% |
| not | 1227 | 0.1% |
| defined | 1187 | 0.1% |
| defined20-29 | 40 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1572266 | |
| - | 1572266 | |
| 9 | 1572266 | |
| 4 | 1171128 | |
| 5 | 880590 | |
| 3 | 496054 | 6.3% |
| 6 | 444888 | 5.6% |
| 7 | 131814 | 1.7% |
| 8 | 19978 | 0.3% |
| e | 2454 | < 0.1% |
| Other values (10) | 11123 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 6289064 | |
| Dash Punctuation | 1572266 | 20.0% |
| Lowercase Letter | 9816 | 0.1% |
| Uppercase Letter | 2454 | < 0.1% |
| Space Separator | 1227 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1572266 | |
| 9 | 1572266 | |
| 4 | 1171128 | |
| 5 | 880590 | |
| 3 | 496054 | 7.9% |
| 6 | 444888 | 7.1% |
| 7 | 131814 | 2.1% |
| 8 | 19978 | 0.3% |
| 2 | 80 | < 0.1% |
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 2454 | |
| o | 1227 | |
| t | 1227 | |
| f | 1227 | |
| i | 1227 | |
| n | 1227 | |
| d | 1227 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 1227 | |
| D | 1227 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1572266 |
Space Separator
| Value | Count | Frequency (%) |
| 1227 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 7862557 | |
| Latin | 12270 | 0.2% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 1572266 | |
| - | 1572266 | |
| 9 | 1572266 | |
| 4 | 1171128 | |
| 5 | 880590 | |
| 3 | 496054 | 6.3% |
| 6 | 444888 | 5.7% |
| 7 | 131814 | 1.7% |
| 8 | 19978 | 0.3% |
| 1227 | < 0.1% |
Latin
| Value | Count | Frequency (%) |
| e | 2454 | |
| N | 1227 | |
| o | 1227 | |
| t | 1227 | |
| D | 1227 | |
| f | 1227 | |
| i | 1227 | |
| n | 1227 | |
| d | 1227 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7874827 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 1572266 | |
| - | 1572266 | |
| 9 | 1572266 | |
| 4 | 1171128 | |
| 5 | 880590 | |
| 3 | 496054 | 6.3% |
| 6 | 444888 | 5.6% |
| 7 | 131814 | 1.7% |
| 8 | 19978 | 0.3% |
| e | 2454 | < 0.1% |
| Other values (10) | 11123 | 0.1% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| CIVIL_ID | BIRTH_DATE | COUNTRY_CODE | COUNTRY_DESC | GENDER_CODE | GENDER_DESC | RLGION_CODE | RLGION_DESC | JOB_CODE | JOB_DESC | SECTOR | ECONOMIC_ACT_CODE | ECONOMIC_ACT_DESC | EDUCATION_CODE | EDUCATION_DESC | MAJOR_CODE | SALARY | SALARY_TYPE | ONR_GVRN_CODE | GOVERNORATE_DESC | MARITAL_STATUS_CODE | MARITAL_STATUS_DESC | COMPANY_NAME | HIRE_DATE | ADDRESS_AUTO_NO | ONR_ID | جنسية | Age | Age Group | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 271121201014 | 1971-01-01 | 101.0 | الكويت | 2.0 | انثى | NaN | 43390 | مسئول | PRIVATE | 61244.0 | التجارة العامة و المقاولات | NaN | NaN | 200.0 | NaN | 5 | محافظة الفروانية | 2 | متزوج | مؤسسة الرهيب الوطنية للتجارة العامة والمقاولات | 2020-02-18 | 13096531.0 | 2.711212e+11 | 1.0 | 50.4 | 60-69 | ||
| 1 | 276032503881 | 1976-01-01 | 721.0 | باكستان | 1.0 | ذكر | 1.0 | مسلم | 83190 | حداد | PRIVATE | 61244.0 | التجارة العامة و المقاولات | 45.0 | متوسط | 453501.0 | 80.0 | NaN | 5 | محافظة الفروانية | 2 | متزوج | مؤسسة الرهيب الوطنية للتجارة العامة والمقاولات | 2008-04-15 | 13096531.0 | 2.711212e+11 | 2.0 | 45.4 | 50-59 |
| 2 | 276121004718 | 1976-01-01 | 107.0 | مصـــر | 1.0 | ذكر | 1.0 | مسلم | 94985 | نقاش | PRIVATE | 61244.0 | التجارة العامة و المقاولات | 45.0 | متوسط | 453501.0 | 100.0 | NaN | 5 | محافظة الفروانية | 1 | أعزب | مؤسسة الرهيب الوطنية للتجارة العامة والمقاولات | 2009-04-06 | 13096531.0 | 2.711212e+11 | 2.0 | 45.4 | 50-59 |
| 3 | 270050801914 | 1970-01-01 | 107.0 | مصـــر | 1.0 | ذكر | 1.0 | مسلم | 94985 | نقاش | PRIVATE | 61244.0 | التجارة العامة و المقاولات | 45.0 | متوسط | 453501.0 | 300.0 | NaN | 5 | محافظة الفروانية | 2 | متزوج | مؤسسة الرهيب الوطنية للتجارة العامة والمقاولات | 2010-02-16 | 13096531.0 | 2.711212e+11 | 2.0 | 51.4 | 60-69 |
| 4 | 294061303976 | 1994-01-01 | 107.0 | مصـــر | 1.0 | ذكر | 1.0 | مسلم | 3560 | فنى كهربائي | PRIVATE | 51204.0 | مقاولات انشاءات كهربائية وميكانيكية مثل محطات توليد الكهرباء | 45.0 | متوسط | 453501.0 | 150.0 | NaN | 5 | محافظة الفروانية | 2 | متزوج | شركة الفيالق الكويتية للتجارة العامة والمقاولات | 2019-06-30 | 13094464.0 | 2.186546e+11 | 2.0 | 27.4 | 30-39 |
| 5 | 281010120878 | 1981-01-01 | 701.0 | افغانستان | 1.0 | ذكر | 1.0 | مسلم | 3560 | فنى كهربائي | PRIVATE | 51204.0 | مقاولات انشاءات كهربائية وميكانيكية مثل محطات توليد الكهرباء | 45.0 | متوسط | 453501.0 | 100.0 | NaN | 5 | محافظة الفروانية | 2 | متزوج | شركة الفيالق الكويتية للتجارة العامة والمقاولات | 2008-09-21 | 13094464.0 | 2.186546e+11 | 2.0 | 40.4 | 50-59 |
| 6 | 277072009153 | 1977-01-01 | 107.0 | مصـــر | 1.0 | ذكر | 1.0 | مسلم | 3560 | فنى كهربائي | PRIVATE | 51204.0 | مقاولات انشاءات كهربائية وميكانيكية مثل محطات توليد الكهرباء | 45.0 | متوسط | 453501.0 | 150.0 | NaN | 5 | محافظة الفروانية | 2 | متزوج | شركة الفيالق الكويتية للتجارة العامة والمقاولات | 2018-12-04 | 13094464.0 | 2.186546e+11 | 2.0 | 44.4 | 50-59 |
| 7 | 292102004229 | 1992-01-01 | 107.0 | مصـــر | 1.0 | ذكر | 1.0 | مسلم | 3560 | فنى كهربائي | PRIVATE | 51204.0 | مقاولات انشاءات كهربائية وميكانيكية مثل محطات توليد الكهرباء | 35.0 | ثانوية | 972370.0 | 150.0 | NaN | 5 | محافظة الفروانية | 1 | أعزب | شركة الفيالق الكويتية للتجارة العامة والمقاولات | 2017-02-27 | 13094464.0 | 2.186546e+11 | 2.0 | 29.4 | 30-39 |
| 8 | 289062303429 | 1989-01-01 | 709.0 | الهنــد | 1.0 | ذكر | 0.0 | ديانات أخري | 37010 | موزع | PRIVATE | 51204.0 | مقاولات انشاءات كهربائية وميكانيكية مثل محطات توليد الكهرباء | 45.0 | متوسط | 453501.0 | 120.0 | NaN | 5 | محافظة الفروانية | 2 | متزوج | شركة الفيالق الكويتية للتجارة العامة والمقاولات | 2013-10-21 | 13094464.0 | 2.186546e+11 | 2.0 | 32.4 | 40-49 |
| 9 | 268040103853 | 1968-01-01 | 110.0 | ســوريا | 1.0 | ذكر | 1.0 | مسلم | 37010 | موزع | PRIVATE | 51204.0 | مقاولات انشاءات كهربائية وميكانيكية مثل محطات توليد الكهرباء | 45.0 | متوسط | 453501.0 | 450.0 | NaN | 5 | محافظة الفروانية | 2 | متزوج | شركة الفيالق الكويتية للتجارة العامة والمقاولات | 2009-01-13 | 13094464.0 | 2.186546e+11 | 2.0 | 53.4 | 60-69 |
Last rows
| CIVIL_ID | BIRTH_DATE | COUNTRY_CODE | COUNTRY_DESC | GENDER_CODE | GENDER_DESC | RLGION_CODE | RLGION_DESC | JOB_CODE | JOB_DESC | SECTOR | ECONOMIC_ACT_CODE | ECONOMIC_ACT_DESC | EDUCATION_CODE | EDUCATION_DESC | MAJOR_CODE | SALARY | SALARY_TYPE | ONR_GVRN_CODE | GOVERNORATE_DESC | MARITAL_STATUS_CODE | MARITAL_STATUS_DESC | COMPANY_NAME | HIRE_DATE | ADDRESS_AUTO_NO | ONR_ID | جنسية | Age | Age Group | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1574030 | 288081002926 | 1988-01-01 | 107.0 | مصـــر | 1.0 | ذكر | 1.0 | مسلم | 99320 | عامل عادى خفيف | PRIVATE | 83102.0 | شراء وبيع الاراضي والعقارات وتقسيمها | 45.0 | متوسط | 453501.0 | 300.0 | NaN | 5 | محافظة الفروانية | 1 | أعزب | شركة الجماعه العقاريه | 2009-07-09 | 12468196.0 | 1.520047e+11 | 2.0 | 33.4 | 40-49 |
| 1574031 | 290090203589 | 1990-01-01 | 107.0 | مصـــر | 1.0 | ذكر | 1.0 | مسلم | 45290 | بائع | PRIVATE | 62159.0 | الاسواق المركزية | 45.0 | متوسط | 453501.0 | 200.0 | NaN | 1 | محافظة العاصمة | 2 | متزوج | شركة سوق المدينة الفلسطيني المركزي | 2019-05-01 | 19138157.0 | 1.120048e+11 | 2.0 | 31.4 | 40-49 |
| 1574032 | 258111500428 | 1958-01-01 | 702.0 | بنجلاديش | 1.0 | ذكر | 1.0 | مسلم | 45290 | بائع | PRIVATE | 61172.0 | تجارة مستحضرات التجميل والعطورات | 35.0 | ثانوية | 353002.0 | 650.0 | NaN | 1 | محافظة العاصمة | 2 | متزوج | شركة طيب كنوز الكويت للعطور | 2008-12-17 | 20994137.0 | 1.120048e+11 | 2.0 | 63.4 | 70-79 |
| 1574033 | 275030202722 | 1975-01-01 | 709.0 | الهنــد | 1.0 | ذكر | 3.0 | هندوسي | 33131 | كاشير | PRIVATE | 63102.0 | المطاعم | 45.0 | متوسط | 453501.0 | 600.0 | NaN | 3 | محافظة الاحمدي | 2 | متزوج | شركه مطعم بومباي برياني | 2008-11-05 | 19438088.0 | 4.740391e+11 | 2.0 | 46.4 | 50-59 |
| 1574034 | 272061404147 | 1972-01-01 | 709.0 | الهنــد | 1.0 | ذكر | 3.0 | هندوسي | 53190 | طباخ | PRIVATE | 63102.0 | المطاعم | 45.0 | متوسط | 45.0 | 180.0 | NaN | 3 | محافظة الاحمدي | 2 | متزوج | شركه مطعم بومباي برياني | 2009-02-03 | 19438088.0 | 4.740391e+11 | 2.0 | 49.4 | 50-59 |
| 1574035 | 293042004597 | 1993-01-01 | 709.0 | الهنــد | 1.0 | ذكر | 0.0 | ديانات أخري | 93990 | عامل انتاج | PRIVATE | 63102.0 | المطاعم | 45.0 | متوسط | 453501.0 | 150.0 | NaN | 3 | محافظة الاحمدي | 1 | أعزب | شركه مطعم بومباي برياني | 2017-02-07 | 19438088.0 | 4.740391e+11 | 2.0 | 28.4 | 30-39 |
| 1574036 | 291070116888 | 1991-01-01 | 709.0 | الهنــد | 1.0 | ذكر | 0.0 | ديانات أخري | 99410 | عامل مطعم | PRIVATE | 63102.0 | المطاعم | 45.0 | متوسط | 453501.0 | 120.0 | NaN | 3 | محافظة الاحمدي | 1 | أعزب | شركه مطعم بومباي برياني | 2018-08-29 | 19438088.0 | 4.740391e+11 | 2.0 | 30.4 | 40-49 |
| 1574037 | 287011100278 | 1987-01-01 | 711.0 | ايــران | 1.0 | ذكر | 1.0 | مسلم | 45290 | بائع | PRIVATE | 62187.0 | الأثاث والمفروشات | 45.0 | متوسط | 453501.0 | 450.0 | NaN | 5 | محافظة الفروانية | 2 | متزوج | شركة الكناني استار للاثاث والمفروشات | 2009-09-14 | 19669564.0 | 1.520047e+11 | 2.0 | 34.4 | 40-49 |
| 1574038 | 299010109068 | 1999-01-01 | 110.0 | ســوريا | 1.0 | ذكر | 1.0 | مسلم | 98515 | سائق مركبه خفيفه | PRIVATE | 62187.0 | الأثاث والمفروشات | 45.0 | متوسط | 453501.0 | 200.0 | NaN | 5 | محافظة الفروانية | 2 | متزوج | شركة الكناني استار للاثاث والمفروشات | 2019-04-24 | 19669564.0 | 1.520047e+11 | 2.0 | 22.4 | 30-39 |
| 1574039 | 292040102201 | 1992-01-01 | 107.0 | مصـــر | 1.0 | ذكر | 1.0 | مسلم | 19530 | مترجم | PRIVATE | 71912.0 | مكاتب السياحة والسفر | 35.0 | ثانوية | 353002.0 | 700.0 | NaN | 1 | محافظة العاصمة | 2 | متزوج | شركة لوفيت للسياحه والسفر | 2013-04-28 | 10139428.0 | 1.120048e+11 | 2.0 | 29.4 | 30-39 |